Ask Julius to train and test
to predict y1 in irrelevant_features.xlsx.
Ask Julius to use GridSearchCV to
find the best max_depth for the random forest regressor in (5, 10, 15, 20, 25)
find the best learning rate for the gradient boosting regressor in (0.001, 0.005, 0.01, 0.05, 0.1, 0.2)
find the best max_depth and best learning rate for the gradient boosting regressor in some sets
Tell Julius to fill in missing values
for categorical features with “None”
for numeric features with 0.
Categorical
| Feature | |
|---|---|
| Row1 | Hi |
| Row2 | Lo |
| Row3 | Med |
| Row4 | Med |
| Row5 | Lo |
Dummies
| Lo | Med | Hi | |
|---|---|---|---|
| Row1 | 0 | 0 | 1 |
| Row2 | 1 | 0 | 0 |
| Row3 | 0 | 1 | 0 |
| Row4 | 0 | 1 | 0 |
| Row5 | 1 | 0 | 0 |
Ask Julius to create a pipeline that
Ask Julius to train and test the pipeline.